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ABSTRACT 

We present measurements of the clustering properties of bright [L > L#) z~4 Lyman 
Break Galaxies (LBGs) selected from the Oxford-Dartmouth Thirty Degree Survey 
(ODT). We describe techniques used to select and evaluate our candidates and cal- 
culate the angular correlation function which we find best fitted by a power law, 
u>{9) = A W 6~P with A w = 15.4 (with 9 in arcseconds), using a constrained slope of 
(3 = 0.8. Using a redshift distribution consistent with photometric models, we de- 
project this correlation function and find a comoving ro = 11.4~ti'gh~ o Mpc in a 
Q m = 0.3 flat A cosmology for Iab ^ 24.5. This corresponds to a linear bias value 
of b = 8.1^2 6 (assuming erg = 0.9). These data show a significantly larger ro and b 
than previous studies at z ~ 4. We interpret this as evidence that the brightest LBGs 
have a larger bias than fainter ones, indicating a strong luminosity dependence for 
the measured bias of an LBG sample. Comparing this against recent results in the 
literature at fainter (sub-L*) limiting magnitudes, and with simple models describing 
the relationship between LBGs and dark matter haloes, we discuss the implications on 
the implied environments and nature of LBGs. It seems that the brightest LBGs (in 
contrast with the majority sub-L* population), have clustering properties, and host 
dark matter halo masses, that are consistent with them being progenitors of the most 
massive galaxies today. 

Key words: surveys - galaxies: high-redshift - galaxies: evolution - galaxies: funda- 
mental parameters - galaxies: statistics 



1 INTRODUCTION 

The study of the universe at very high redshifts has ex- 
panded rapidly over the last decade. It is now possible to ob- 
serve galaxies over more than 90% of the age of the Universe. 
One of the most significant breakthroughs has been the dis- 
covery of a population of strongly-clustered, star forming 
galaxies at 2 .5 < z < 4.5, using the Lym an-break technique 
pioneered bv lSteidel fc Hamilton! 1^992). It is possible to se- 
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lect significant numbers of these galaxies using deep ground 
based multi-colour imaging (e.g.. lSteidel et alJll996ft . 

Lyman Break Galaxies (LBGs) represent the largest 
known population of high redshift objects, and therefore 
present a window into an important stage in the forma- 
tion of galaxies and large-scale structure. Much is now 
known about their physical c haracteristics jPettini et alJ 
l200lHShaplev et alJl200lll2003l) . They are somewhat dusty 
starburst galaxies with star formation rates in the range 10- 
few 100s Moyr -1 contributing a highly significant fraction of 
the stars formed at z ~ 2.5-5 iAdelberger fc Steidell EoOO). 

Comparison of their clustering properties and number 
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densities with semi-analytic models suggests that they are 
either relatively small galaxies, experiencing brief and infre- 
quent bursts of star formation that are primarily driven by 
galaxy-galaxy mergers ('The Collis ional Starburst Model'; 
ISomerville. Primack. fe FaberlEoOlh . or m very massive en- 
vironments with large reservoirs of gas, beco ming massive 
L* galaxies today f'The Massive Halo Model': Isteidel et alJ 
ll99fJ:lBaugh et alJll998ft . 



Although these scenarios have been tested against 
models of galaxy formation using the LBG angular 
correlation function on different scales, and as a func- 
tion of luminosity llBull ock. Wechsler, fc Somervilkj |2002t 
I Wechsler. Somerville. Bullock. Kolatt. Primack. Blumenthal. 
2001) the results are still somewhat dependent on the ob- 
servational sample used. Correlati on scale measurements 
range from rp = 2— 1 2 I hTan Mpc iAdelb ereer_et_aD Ll99d 
Giavalisco et alJ Il99sl Isteidel et al.1 Il998l: lArnouts et all 
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Figure 1. Filter curves for the full set of filters employed in the 
nnT Survey, convolved with the chip QE curve. 
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19991: IGiavalisco fc Dickinsonl l200lt lOuchi et al.l I200L 
Adelbereer et al.ll2003l : iFoucaud et al.ll2003l) . The range is 
likely due in part to cosmic variance because of relatively 
small sample sizes and may also reflect the luminosity 
dependence of clustering and the effects of small scale 
clustering. 



The data, (especially in the more studied z ~ 
3 population) do no t yet provide unequivocal answers. 
iPorciani fc Giavaliscol i2002f) find a lack of power in the 
angular correlation f unction on small scales sugge sting 
few LBG clos e pairs . IGiavalisco fc Dickinsonl feOOll) and 
IFoucaud et al.1 i200St) find fainter LBGs less strongly clus- 
tered than b righter ones. Conve rsely, the clustering results 
at z ~ 4 of lOuchi et alJ (120011) imply an excess of LBG 
close pairs from which they estimate an LBG merger rate. 
Using z ~ 3 cl ustering data provided by Adelberger et ah, 
several authors (ISomerville et al' l200lHWechsler et alj200ll 
iBullock et ail f2002) have shown that the collisional star- 
burst type models are marginally more favourable than mas- 
sive halo models. However it seems clear that current data 
sets are not extensive enough, and do not have accurate 
enough photometric redshifts in large samples, to provide 
very strong constraints on the relationship between LBGs 
and their host dark matter haloes. To begin to explore these 
questions it is useful to have very wide and deep multi-band 
imaging, to optimise the selection and redshift determina- 
tion for many thousands of LBG candidates, over a range in 
absolute magnitudes. The ODT is one such survey. 

The structure of this paper is as follows. In §|2] wc 
present the ODT survey, and its salient characteristics. 
The candidate selection techniques, and their limitations, 
are discussed in §|3] The general statistical properties of 
z ~ 4 LBGs are presented in §2J and the projected and 
spatial correlation functions are the topic of §|S| These re- 
sults are placed in a theoretical context, focusing on the 
nature of the environments in which LBGs exist in §|SJ the 
main conclusions finally being drawn in §Q Throughout this 
paper we use AB magnitudes, and assume a (r2 m ,r2A,^s) 
= (0.3, 0.7, 0.9) cosmology, unless otherwise stated, with 
Ho = 100/iiookms _1 Mpc _1 . 



2 THE OXFORD-DARTMOUTH THIRTY 
DEGREE SURVEY 

The Oxford-Dartmouth Thirty Degree Survey (ODT) is a 
deep-wide survey using the Wide Field Camera (WFC) on 
the 2.5-m Isaac Newton Telescope (INT) at La Palma. The 
survey employs six broad-band filters in UBVRi'Z, and 
the transmission curves are provided in Figure When 
completed the survey will cover ~30 deg 2 in BVRi'Z to 
J?5o- = 25.25 in four fields of 5 — 10 deg 2 each. The coordi- 
nates of these four fields are provided in Tabled Presently, 
approximately 25 deg 2 of the survey have been observed 
in BVRi' Z, although only data from the Andromeda field 
are analysed in detail here. The U data has only been ob- 
served in the best conditions, and currently covers ~1 deg 2 , 
all of which is in the Andromeda field. We are also cur- 
rently undertaking a if-band survey to K$ a m 18.5 using 
the 1.3-m McGraw -Hill Telescop e at the MDM Observa- 
tory on Kitt Peak JOldind 12002) . The three largest fields 
are also covered at radio frequencies with the VLA in A 
and D array at 1.4GHz, and the Lynx field is covered by 
low frequency radio observations with the VLA in A array 
at 74MHz and 330MHz. In addition, we are able to obtain 
redshifts for several sources in the survey which are part of 
the Texas-Oxf ord One-Thousand (TO OT) redshift survey of 
radio sources jHill fc Rawlingsll2003lh 

Whilst the ODT survey has a number of scientific aims, 
one of the principal projects is the detection of large num- 
bers of LBGs (at 2 ~4) over a wider field than previous stud- 
ies. The ODT survey reaches depths comparable to previous 
studies of LBGs (e.g lSteidel et aJjll999t lOuchi et ahll200ll) 
but over a significantly large r area of sky (c.f. ~900 square 
arcinm m lOuchi et alJl200ll) . The ODT therefore provides 
an ideal data set for the selection of a sample of the brightest 
LBGs. 

The fields discussed in this paper were observed with 
the 2.5-m INT during August 1998 and September 2000. 
Approximate 5a (isophotal) depths for the data along with 
exposure times are shown in Table |2] The median seeing for 
these observations is ~l.l". 

The reduction of CCD frames was carried out using the 
IRAF package and photomet r y cali brated for each frame by 
observing fields of lLandoltl fl992ft on photometric nights. 
The thinned chips in the INT WFC mean that fringing of 
sky lines becomes a problem in i' , Z, and to a lesser ex- 
tent, R data. Fringing was removed from each image in turn 
by using a fringe frame generated by the combination of 
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Field 


a(J2000) 


<5(J2000) 


I 


b 


Andromeda* 


00 18 24 


+34 52 00 


115 


-27 


Lynx*+ 


09 09 45 


+40 50 00 


181 


+42 


Hercules* 


16 39 30 


+45 24 00 


70 


+41 


Virgo 


13 40 00 


+02 30 00 


330 


+62 



Table 1. 3° X 3° field centres for the ODT survey in equatorial 
and Galactic coordinates. *Fields with 1.4GHz radio data. ^Field 
with 74MHz and 330MHz radio data. 



Filter 


Exposure 


5-cr 




Time 


Limit 


U 


6 x 1200s 


26.0 


B 


3 x 900s 


26.0 


V 


3 x 1000s 


25.5 


R 


3 x 1200s 


25.3 


i' 


3x 1100s 


24.5 


Z 


1x600s 


22.5 



Table 2. Exposure times and detection limits (AB at 5cr isopho- 
tal) for the different passbands used for the ODT survey. With the 
exception of the U images (which use 6), each pointing is broken 
up into three separate exposures with 5" offsets of the telescope. 
The detection limits are approximately equal to 5<r detections us- 
ing a 2"circular aperture and 3<r detections with a 3.3"circular 
aperture. 



individual images for each chip. This method was success- 
ful for the i! and R— band data, but the band data have 
more significant fringing, and little of the acquired data have 
been incorporated to date. Imag e detections were carrie d 
out using the SExtractor package feertin fc Arnoutsffi 996l. 
although we chose to use our own background subtraction 
method, due to significant background gradients present in 
some of our images. 

Each WFC pointing was arranged in a diagonal grid 
to cover each field, with several overlaps between adjacent 
pointings. This allows objects in overlaps to be matched and 
a common photomet ric zeropoint to be applied to each field, 
using the method of lGlazebrook et alJ <ll994tl . This method 
was used to obtain a common zeropoint for the V-band data 
which has the smallest scatter between chips and is not af- 
flicted by fringing from sky lines. 

In order to obtain consistent colours throughout the 
survey, photometry for other bands was corrected relative to 
the V-band data. This was done by minimising the deviation 
of the colo urs of ODT s tellar objects from the colours of 
stars from iPicklesI <ll998T) in the colour-colour plane. The 
ODT survey a nd our data reduc t ion pr ocess is described in 
more detail in iMacDonald et all i2004h . 



3 CANDIDATE SELECTION 

The LBG population of galaxies are traditionally selected 
by exploiting the break in their spectra shortwards of the 
Lyman-Limit (A res t = 912 A). The break is caused by an 
intrinsic drop in the spectra of the massive stars which dom- 
inate the LBG spectrum, and more importantly, the absorp- 
tion of photons shortwards of the Lyman-Limit by H i in the 
interstellar medium of the LBG. The break is further ac- 
centuated by absorption from Lyman-Limit Systems along 
the line-of-sight to the galaxy. By choosing appropriate filter 
systems to straddle this break, it becomes possible to select 
high-z galaxy candidates. 

The 'standard' method of LBG selection uses a 3-band 
filter set to isolate high-z galaxies in the colo ur-colour plane 
(e.g. ISteidel et all ll99St lOuchi et all 1200 if) . The selection 
region is denned using the predicted colours of synthetic 
galaxy spectra at high redshifts, and where spectroscopy is 
available, this can often be refined. However, for many sur- 
veys such as the ODT or the Hubble Deep Field (HDF), 
further information from extra filters may be available. By 
using this extra information we c an obtain photometric red - 
shifts for objects in a survey (e.g. lArnouts et alll999ll2002h . 
and select the high redshift objects. It is also possible, with 
multi-band data, to consider more than one filter combina- 
tion as we discuss in the next section. 

3.1 Models of High-z Galaxy Colours 

In order to define our selection criteria for LBG candidates 
we plot colours for model galaxies (see Fi g ures |^1 and [3J 
using extended IColeman. Wu. fc Weedmanl Jl98Ct) empiri- 
cal spectral templates convolved with the INT WFC fil- 
ter set. We consider two different regimes; selection using 
B — V/V — i' colours, and selection with B — R/R — i' 
colours. In additio n, we pl o t stel lar colours using the stel- 
lar SED library of lPicklesI (Il99gft . Galaxy colours are also 
adjusted for a bsorption from the IGM using the method of 
iMadaul il995l) . This attenuation of flux due to the opacity 
of the IGM proves to be a significant contributor to the fi- 
nal observed colours of high-z objects. In addition to these 
e mpirical template s , we a lso compare with the model grids 
of ISteve ns fc Lacy| l|200 l|l. which are based on the Pegase 
models of iFioc fcFtocca-Volmerangel il997T) and the stan- 
dard stars of lLandold il992f) ~ again using our INT filter set. 
These models contain SEDs with a range of starburst and 
star formation histories along with different amounts of dust. 
However they compare favourably with the simple model 
colours shown in Figures [2] and El The colours of z ~ 4 ob- 
jects are dominated by the effects of the Lyman break, and 
LBGs occupy essentially the same region of colour space in 
both models. 

In order to efficiently select high redshift objects, we 
define the following colour-colour cuts based upon the model 
galaxy colours shown in Figure [5] for B — V/V — i selection. 
For -1.0 < (V-i') < 0.41, 

(B-V)>1.2. (1) 
For 0.41 < (V - i') < 2.3, 

(B — V) > 0.95 x (V - i*) + 0.81. (2) 
Objects with (V-i') < -1.0 and (V-i') > 2.3 are excluded. 
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Figure 2. Model B — V versus V — i ' colo urs for galaxies to 
z ~ 5. Four SEDs from IColeman et alj ll98fJ) are used. No evo- 
lution is assumed, but the SEDs are modified to account for the 
affect s of attenuat ion due to intergalactic absorbers at high red - 
shift <Madaull995t) . The four SED types of lColeman et al.Hl98rJ) 
are shown as; a red dotted line (type E), a green short-broken 
line (type Sbc), a blue long-broken line (type Scd), and a ma- 
genta solid line (type Im). Also shown as black open st ars, are 
the co lours of Galactic stars, using the stellar library of iPicklesl 
1 1998). Important redshifts are labelled, most importantly, galax- 
ies with z > 3.5 (i.e. LBGs), and, on the Elliptical track, inter- 
mediate rcdshift (z = 0.2 — 1.1) objects, which are a prime source 
of contamination in a high redshift sample (see Section 13.31 . The 
final ODT selection window for selecting z ~ 4 objects is bounded 
by the solid black line. 



For B — R/R — i selection, we use the following colour cuts 

based on Figure 

For -1.0 < (R-i') < -0.07, 

{B-R)> 1.25. (3) 
For -0.07 < (R-i') < 0.9, 

(B - R) > 3.25 X (R - + 1.5. (4) 
Objects with (R—i') < —1.0 and (R—i 1 ) > 0.9 are excluded. 

3.2 Colour-Colour Selection 

Colours for a small subsection of the ODT survey are shown 
in Figures ^] and along with the LBG selection region in 
colour-colour space. We also require a detection in the V, 
R and i' bands, increasing the probability of only detecting 
real objects close to the faint limits of the survey. In addi- 
tion, only candidates fainter than il = 23.0 are considered in 
order to reduce contamination from lower red shift ellipticals 
with similar colours to LBGs JSteidel et alj ll999). Finally, 
candidates that have B > 25.5 or are not detected by SEx- 
tractor in B, are treated as non-detections and are given an 
limiting magnitude of B = 25.5. 

Given that we have 4-band information available to us, 
we impose the additional criterion that the object lies in the 
selection regions of both B — V/V — i' and B — RjR — i! (Fig- 
ures[I]and[KJ . Since the surface densities of objects are much 
higher just outside the selection region than just inside, low 
redshift objects are much more likely to be scattered into the 
selection region than vice-versa. This is somewhat reduced 
by requiring that an object is selected in both planes, and 
indeed we obtain candidate surface densities comparable to 
other searches for z ~ 4 objects (see Figure[5J. The contam- 
ination of the sample, and the effects of this restriction are 
discussed further in the next section. 




-0.5 0.5 1 1.5 

R-i 



Figure 3. As Figure |2] but using model B — R versus B. — i' 
colours. 



3.3 Contamination 

Although our selection criteria are fairly conservative, con- 
tamination of a non-spectroscopic sample such of this is ex- 
pected. The prime candidates for contamination are fore- 
ground Galactic stars, high redshift quasars, and most im- 
portantly, elliptical galaxies with 0.2 < z < 1.1. Spectro- 
scopic follo w-up of a sample se lected using similar selection 
criteria by ISteidel et al] lll999l) confirmed that about 20% 
of candidates were in fact elliptical galaxies at intermediate 
redshifts. The peak of the luminosity function for elliptical 
galaxies with z ~ 0.7 corresponds to galaxies with i' ~ 22.0 
iBell et al] 12004) . Having a bright cut-off of i = 23.0 for 
the sample should reduce contamination from these objects 
significantly, although a number of faint potential contami- 
nants are expected to remain. In the next Section, we con- 
sider two faint limits of i — 24.0 and i = 24.5, and so 
estimate contamination levels for both limits here. 

In Figures |5] and |HJ the elliptical track is noticeably 
more extended than for other types. Objects in the range 
0.2 < z < 1.1 are both close enough, and faint enough, to 
scatter into the LBG selection region. In order to estimate 
the potential level of contamination from these objects, we 
simulate their expected colours by combining their colours 
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Figure 4. B — V versus V — i' colours for ODT data. For clar- 
ity, only 5000 randomly chosen objects are shown. The resulting 
colour distribution is representative of the full data set. All objects 
with detections in V, R, and i' down to the limits of the survey 
fTablel^l are shown. Objects with no B detection are shown as red 
arrows. Stellar objects are designated by blue open stars, these 
were selected using SExtractor's stellarity parameter, and objects 
with stellarity > 0.8 and R < 23.0 are plotted. All other objects 
are shown as black circles. The stellar locus is clearly visible and 
this provides a useful cross-check of the photometric consistency 
between fields. The selection region determined from the models 
discussed in section f3.il is bounded by the black solid line. 
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Figure 5. As FigureKlbut using the B — R versus R — i' colours 
for a 5000 object subset from the ODT. 



from an SED with typical photometric errors. We then nor- 
malise their numbers with an appropriate luminosity func- 
tion, and test how many objects would meet our selection 
criteria. To ob tain the colour s of el liptical galaxies we use 
the SED from lColeman et alJ dl98fJ) . as plotted in Figures 
13 and The expected number of galaxies can then be ob- 
tained by integ rating the galaxy luminosity function using 
jGardnerlll99ai : 

n(m, z) AraAz — — — ^—(btm, z) dmdz, (5) 

47T dz 

where cj>(m, z) is the galaxy luminosity function, and dV/dz 
is the comoving volume element at redshift z. We use the 
i?-band luminosity function for elliptical galaxies from th e 
ESO-Sculptor survey at z ~ 0.5 ide Lapparent et alJl200al . 
The rest-frame R— band and redshift range covered by this 
luminosity function compares favourably to the objects of in- 
terest here. Little, or no evolution of the lu minosity functio n 
is expected in our redshift range of interest feell et all 2004'l. 
To simulate photometric errors, a random error is generated 
using a probability distribution describing the typical error 
as a function of magnitude. The error in each band is then 
added to the respective magnitude. If just one combination 
of colours is used (especially B — V/V — i'), then the con- 
tamination can be quite considerable (up to 27%, see Table 
|iU . However, by requiring that LBGs meet both selection 
criteria (B — V/V — i' and B — R/R — i'), the expected 
contamination from intermediate redshift galaxies can be 
reduced significantly. For i' < 24.0 we find a contamination 
of 4%, and for i' < 24.5 we obtain 3%. 

The ODT Andromeda field (i.e. the data discussed in 
this paper) has a fairly low Galactic latitude of / = —27°, 
and therefore Galactic stars must be considered as poten- 
tial contaminants. At the magnitudes considered here, the 
SExtractor star-galaxy classifier (in general) fails to distin- 
guish cleanly between stellar and galactic light profiles. In 
particular, red stars are prone to be found in the selection 
window due to extreme intrinsic colours, or because they lie 
close to the selection window and are scattered in because 
of photometric errors, or a combination of both. In order 
to estim ate the contamination due to stars, a m odel of the 
Galaxy IIRobin fc Crezj |l986: Ro bin et al] ll996l is used to 
generate a mock catalogue 1 of Galactic stars at the Galac- 
tic latitude and longitude of the ODT Andromeda field. 
This model contains elements from the thin and thick disk, 
along with the spheroi d of the Galaxy, and a simple model 
of Galactic extinction iRobin fc Crezd Fl986). The V— band 
number counts generated by this model we re compared with 
the 'Bahcall-Soneira' model llBahcalllll986fl . and with stellar 
number counts over the range 18 < V < 22 from the data 
itself, and found to be consistent. 

The model produces (without photometric errors) the 
expected mixture of stars at given Galactic coordinates in 
Johnson- Cousins U BV R C I C (Vega) magnitudes. These were 
then transformed to the ODT UBVRi' (AB) system and 
combined with a function to simulate a typical photometric 
error in each band, as done for the simulated galaxies. Our 
selection criteria were then applied to these stars. In total, 17 
stars (in one square degree) are found to match the selection 



1 http:/ /www. obs-besancon.fr/www/modele/ 
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Magnitude Range Stars Ellipticals Total 



BVi' BRi' Both BVi' BRA' Both BVi' BRi' Both 



23.0-24.0 22% 30% 8% 27% 4% 4% 49% 34% 12% 

23.0- 24.5 20% 28% 5% 15% 7% 3% 35% 35% 8% 



Table 3. Summary of expected contamination for stars and elliptical galaxies. The contamination fraction can be significantly reduced 
by using both selection criteria. 



Field" 


RA 6 


Dec c 


Seeing d 


Area e 


LBG 


B 


V 


R 


i' 


1012* 


00 11 38.47 


+35 39 36.7 


0.95 


675.7 


22 


25.8 


25.1 


25.1 


24.7 


1013 


00 13 49.66 


+35 45 47.8 


1.07 


460.7 


9 


25.7 


25.1 


24.6 


24.4 


1015 


00 16 02.41 


+35 51 51.3 


1.24 


424.7 


10 


25.5 


25.0 


25.0 


24.5 


1026 


00 10 29.80 


+35 03 39.3 


1.09 


538.7 


18 


25.5 


24.6 


24.8 


24.5 


1029 


00 18 19.46 


+35 34 47.4 


1.16 


458.8 


15 


25.5 


25.1 


25.0 


24.4 



Table 4. Data for ODT Andromeda fields used in this paper. These fields form a single contiguous field, (a) Field name, (b) RA of field 
centre, (hms) (c) Dec of field centre, (dms) (d) seeing (averaged over BVRi' bands), (e) Area in square arcminutes (after removal of bad 
areas of the chip, bright stars, satellite trails etc.). (f) Number of LBGs with i' < 24.0. The final four columns give the magnitude at 
which the total number counts turn over in each field. *Field used for the i < 24.5 LBG sample. 



criteria to il = 24.5, 7 of which have il < 24.0. Given the 
surface densities of LBG candidates (see Section 14.31 . this 
corresponds to a contamination of 8% for il < 24.0, and 
5% for i' < 24.5. We note that, again, these results increase 
significantly if only B — V/V — il or B — R/R — il selection 
is used (see Table |3J. 

Finally, usin g the quasar luminosity function at z ~ 4 
iFan et al.ll200lf) . we expect that even if all high-z quasars 
were to have colours consistent with LBGs, their expected 
number densities make any contamination from this popula- 
tion negligible. Our final contamination levels are therefore 
12% for i' < 24.0, and 8% for i' < 24.5. The contamina- 
tion results are summarised in Tabled] These results com- 
pare favourably with other LBG surveys (e.g. lOuchi et alJ 
2001), although the overall contamination level is reduced 
through the advantage of having two sets of selection crite- 
ria. Objects are less likely to be scattered into both selection 
windows through photometric errors than just one. We also 
note that our observed surface densities are fully consistent 
with other surveys (see Section [4.311 . implying similar levels 
of contamination and completeness. 

3.4 The ODT LBG Samples 

This paper uses the first 2 square degrees of ODT data in the 
Andromeda field, which was the first part of the ODT survey 
to be reduced and analysed. Unfortunately the quality and 
depth of the survey is variable, and we restrict the data used 
in this paper to those regions with the best seeing. In order 
to reduce contamination from any spurious signal caused 
by field-to-field variations we consider two samples of data, 
one to il < 24.5, and the other to i' < 24.0. These were 



Sample 


Limit («') 


Fields 


Number 


Total Area 


Bright 


24.0 


All 


74 


0.81 deg 2 


Faint 


24.5 


1012 


66 


676 arcmin 2 



Table 5. Summary of 'bright' and 'faint' LBG samples selected 
from the ODT. 



chosen to provide a contiguous area covered by all colours 
to the required depth, where we are confident of consistent 
completeness in all bands. When fields overlap, the overlap 
regions are split equally (in R.A.) between the two fields, 
and objects in the region will have photometry measured 
from one of the fields. 

The properties of the fields used (including depths) are 
summarised in Table^] and Table|K|summarises the proper- 
ties of the two selected samples. All five pointings are used in 
the bright (i' < 24.0) sample. These cover a region with good 
seeing and a completeness beyond the required il < 24.0. 
The distribution of these objects is shown in Figure [(J Af- 
ter removal of satellite trails, overlaps, diffraction spikes, 
and bright objects, the total area covered is 0.83 deg 2 . This 
sample consists of 74 objects. The 'faint' (i' < 24.5) sample, 
consists of just one INT WFC pointing covering 676 arcmin 2 
(Field 1012 in Table 0J. The distribution of these objects is 
shown in Figure |7| This sample is made up of 66 objects. 



© 0000 RAS, MNRAS 000, 000-000 



Clustering of Bright Lyman Break Galaxies in the ODT Survey 7 



LBG cands 23.0 < i < 24.0 




RA (Degrees) 



Figure 6. The distribution of our 'bright' sample of LBG candi- 
dates, with 23 < i' < 24. The outlines of the chips which make up 
the INT wide field camera pointings for this mosaic are shown. 



LBG cands 23 < i < 24.5 



v» ♦ . 




♦ 

♦ 


♦ 
♦ 

♦ ♦ 
♦ ♦ ♦ . 

* 
* 


♦ . ♦ 

♦ 

♦ ♦ ♦ 



3 2.8 2.6 

RA (Degrees) 



Figure 7. The distribution of our 'faint' sample of LBG candi- 
dates, with 23 < i' < 24.5. The outlines of the 4 chips which 
make up the INT wide field camera are shown. 



not currently forthcoming, and represents a major challenge 
for current instrumentation, even on 8— m class telescopes. 

However, in this paper, their most important use is in 
their statistical distribution, N(z), and the corresponding 
selection function. This is used for deprojecting the angular 
correlation function to retrieve the spatial correlation func- 
tion £(r), and for detailed studies of the effects of different 
dark matter halo occupation function parameterisations on 
the global statistics, but not for the selection of LBG can- 
didates. 



4.2 The Selection Function 

The selection function, as represented by the redshift distri- 
bution, is approximated by a simple analytic model based 
on the combined results from the c olour-colour diag r ams i n 
Figures [5] and [3] and the models of IStevens fc Lacvl i200ll) . 
The main use of our redshift distribution will be the de- 
projection of the angular correlation function to obtain ro 
(Section^, and we model redshift distributions using: 



N(z) 



v 7 ^ 



: exp 



2ai 



(6) 



where z is the mean, and a z is the standard deviation. 

We find that z — 4.0 and a z — 0.2 is a reasonable 
fiducial set of values that reproduces the photometrically- 
determined distributions. In order to test the dependence 
of ro on N(z), we also consider other selection functions 
with varying mean redshifts, z, and dispersions, a z , as well 
as modelling the redshift distribution as a simple top-hat 
function. These selection functions are shown in Figure |H| 
However, as we discuss in Section l5,2l at these redshifts, the 
deprojected ro we obtain is quite insensitive to the N(z) 
used. We do not propagate systematic errors from the un- 
certainty in N(z) in what follows. 



4.3 Surface and Space Densities 

The surface density of objects is calculated by selecting out 
the regions contaminated by bright stars, chip edge-effects, 
satellite trails, exceptionally poor seeing and strong remnant 
fringe effects. The sample selection described here is per- 
formed in the j'-band, requiring detection in the R and V 
bands as well. The measured surface densities, with Poisson 
uncertainties, are given in Table HJ and shown in Figure |3] 
In calculating these surface densities we only use those im- 
ages that are both deep enough and have good seeing as de- 
scribed in SectionESl Figure^lindicates that t he ODT LBG 
surface densities compa re well to other work djSteidel et alJ 
ll999UOuchi et all200ll) . We also calculate the space density 
using: 



4 GENERAL PROPERTIES OF LBGS 
4.1 Redshifts 

Although the primary selection criterion for LBG candidates 
is the effect of the integrated opacity by intergalactic Hi, 
redshifts are an important part of the analysis and interpre- 
tation of such samples. Obtaining spectroscopic redshifts for 
more than a very small fraction of our LBG candidates is 



,„ .dV , 
N(z)—dz 
dz 



(7) 



where E is the measured surface density in arcmin - , as 
shown in Figure [5] We use N(z) from Eq. |S| and dV(z) is 
the comoving differential volume element per square stera- 
dian for the cosmology of choice. The coefficient An = 
1.1818 x 10 7 arcmin 2 sr~ 2 . The calculated surface and space 
densities are presented in Table |H| In the next Section we 
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Figure 8. Normalised fiducial redshift distribution consistent 
with photometric models (bold solid line corresponding to a Gaus- 
sian with z = 4.0 and a z = 0.2). The other lines show the range of 
selection functions considered to test the sensitivity of the Limber 
deprojection to the assumed redshift distribution. The bold short- 
dashed line shows a 'tophat' function with centred on z = 4.0 
with z±0.5. The thin long dashed lines correspond to Gaussians 
with 2 = 3.5 and 4.5 with c z =0.2. Finally, the thin solid line 
corresponds to a much wider redshift distribution with cr z =0.4. 



magnitude 


range («') 


E (arcmin 2 ) 


n E (h 3 wo Mpc" 3 ) 


23.0 - 


23.5 


0.0065 ± 0.0041 


6.29 ± 5.30 x 10" 6 


23.5 - 


24.0 


0.0165 ± 0.0085 


1.60 ± 1.10 x 10~ 5 


24.0 - 


24.5 


0.0685 ± 0.0101 


6.63 ± 1.31 x 10~ 5 



Table 6. The surface and space densities for different i'-band 
selection magnitude ranges. Surface densities are per half magni- 
tude bin. 



also explore how sensitive the correlation function parame- 
ters are relative to the choice of N(z), and to the limiting 
magnitude or chosen magnitude range. 



5 CORRELATION FUNCTIONS 

5.1 Angular Correlation Function 

One of the more important results to come from LBG stud- 
ies has been the measure of their angular correlation func- 
tion, and a ssociated rp and bias parameter, 'b'. It has been 
shown (e.g IWechsler et alJlioOll: iBullock et al J 12002^ that 
this can be used to constrain the halo occupation function 
for LBGs (see Section |SJ , which in turn provides informa- 
tion on their typical masses an d likely evolutionary fate (e.g. 
iMoustakas fc Somerviliell2002l) . 



Figure 9. The surface density of colour-colour selected LBGs 
in the OPT Surve y (filled square s). Also shown are data from 
lOuchi et all 1200 Jl (open circles') . Isteidel et alJ Jl99Sl) (cros ses), 
and data from the Hubble Deep Field. lArnouts et alJ Il999fl (as- 
terisks). Surface densities are plotted per unit magnitude. 



The clustering of galaxies, as represented by the two- 
point correlation function, has been shown to be well ap- 
proximated by a power law fe.g |Peebleslll98tjD of the form 
f(r) = (r/rp) -7 . The angular projection of this also follows 
a power law of the form w(6) = A W Q~^ , where /3 = 7 — 1. 
We calculate the two-point angular correlation function, 
u}(0), for j'-band selec ted galaxies, using the estimator of 
lLandv fc SzalavH 19931) : 



DD{e) -w^2^mw 2 , 



RR{6) 



RR{6) 



(8) 



where DD(8), RR(6), and DR(6) are the numbers of 'data- 
data', 'random-random' and 'data-random' pairs, respec- 
tively, and Wi and Wi account for the numbers of data 
and random points (-/V ran and iVdata) use d to estimate the 
correlation function jRoche fc Ealeslll999]) . Here, 



Wi 



and 



W 2 = 



N ran r; 



iVdata(iVd 



(A^ran 



1)' 



(9) 



(10) 



In the data we remove bright stars, diffraction spikes, bad 
columns, etc., and constrain the random sample so that it 
follows the same geometrical constraints as the data. 

The Poisson error for the angular correlation function 
can be calculated using: 



(11) 



However, a w ($) is probably an underestimate of the actual 
error in ui(8) when the number of data points used is small 
jBaueh et alJll99fj) . In order to obtain a more appropriate 



© 0000 RAS, MNRAS 000, 000-000 



Clustering of Bright Lyman Break Galaxies in the ODT Survey 9 



estimate of the er rors on w(8), we compute bootstrap errors 
jLing et alJll986l) . 

If a sample contains n galaxies then a 'bootstrap' sam- 
ple can be created by drawing (with replacement), n galax- 
ies from the original galaxy sample. This process can be 
repeated N times and the correlation function, Wi(6), cal- 
culated for each of the N bootstrap samples (where i = 
1, 2, 3...-/V). The estimate of the error in w(8) is then given 
by: 



""boot 



\ 




Wi(6) 



N 



(12) 



The computed bootstrap errors are shown in Figures UHl and 
1111 and are used in what follows. 

Due to the fact that we are using a finite solid angle, 
we correct the estimate of u)(6) for the inte gral constraint 
jGroth fc Peebledll977tlRoche fc Ealeslll999l) : 



cj(6>i 2 )<5fii(«72 



(13) 



where Q is the solid angle. The integral constraint can be 
estimated (providing enough random points are used) with 
IC — A W B where B is given by: 



B 



Y,RR(s)e' 



J2RR(e) 

The angular correlation function then becomes, 

w(6) = w obs (e) + ic = A w e- 8 . 



(14) 



(15) 



The best fitting parameters A w and j3 can be found by find- 
ing the best (\ 2 minimisation 2 ) fit to the function w o bs(0) = 
A w (6^ /3 — B), for different values of /3, where B is recalcu- 
lated each time. The error on A w is computed as the value 
that gives A\ 2 = 1. 

Figure 1101 shows the angular correlation function for 
the fainter sample of 66 LBG candidates covering an area 
where we are confident there is comparable completeness 
to i' = 24.5. Some 90,000 random objects were used in this 
calculation. The angular correlation function for the 'bright' 
sample to i' — 24.0 is shown in Figure ITTI based on 74 LBG 
candidates. Here, 250,000 random objects were used to cal- 
culate the correlation function. Since the clustering strength 
of faint galaxies is significantly less than that discussed be- 
low for bright LBGs, faint galaxies in the same catalogue 
could also be used as an effective sample of 'random' objects. 
This provides a good test that there is no remnant structure 
due to varying completeness in the i'— band data. The galax- 
ies to be used as 'random' objects were selected to cover the 
same magnitude range as the LBGs (i.e. 23.0 < i' < 24.5 for 
the faint sample, and 23.0 < i' < 24.0 for the bright sam- 
ple). Similar correlation strengths to using a truly random 
sample were obtained. For the i' < 24.5 sample a slightly 
higher value of A w = 15.52 was measured (Xicd = 1-01). For 
i' < 24.0 a value of A w = 17.58 was obtained (x? e d = 0.89). 



2 The technique of \ 2 minimisation assumes independent errors. 
In fact, the errors on w(8) measurements are correlated. This 
complication is neglected here, but we note that if \ 2 is used to 
test goodness-of-fit, it is likely to be an underestimate. 



0=0.8 
A w =15.39 
IC 0.064 




Figure 10. The angular correlation function for 66 i' < 24.5 
LBGs (bottom panel). The fit is restricted to a fixed slope of 
P = 0.8. The top panel shows l + w(8). Data points are corrected 
for the integral constraint 



There is increasing evidence that faint samples of I- 
band selected galaxies (and therefore galaxies with a higher 
median redshift) have a shallower slope, /3, to the best fitting 
power law w(6) = A ro # _/3 than the often q uoted = 0.8 seen 
in clustering studies of loca l galaxies fe.g. fBrainerd fc Smaill 
Il998t iPostman et al]ll998l : iMcCracken et al.ll200llh In ~ 
dition, many measurements of LBG clusterin g also find a 
best fitting slope that is shallower than 0.8 fe.g lOuchi et all 
2001; Por ciani fc Giavaliscdl20 02'l . although few of the mea- 
surements are accurate enough to test whether or not this 
is significantly different to the local Universe, and so results 
are usually quoted with a fit constrained to f3 — 0.8. 

In the ODT data for i! < 24.5 the best fitting pa- 
rameters (with 6 in arcsec) are consistent with this (A w — 
7.43l5 2 4 , (3 — 0. 63^0 23)- However, there are not enough 
data to place a strong constraint on this slope. Therefore, 
to be consistent with other measures of LBG clustering in 
the literature, only results for a constrained slope of (3 — 0.8 
are considered in what follows. However, we note that LBG 
correlation functions with a shallower slope would have the 
general effect of increasing estimates of ro (in this case by 
~ 15%). This is something that will need to be looked into 
in more detail in the full ODT sample, and in other high 
quality data sets probing the high redshift Universe. 

Confining the slope of the angular correlation function 
to 0.8, we obtain an amplitude A w of 15.39 ± 4.3 arcsec ' 8 
for the faint i' < 24.5 sample. For the brighter sample a 
best-fitting amplitude of A w = 16.40 ± 8.6 arcsec 0,8 was 
calculated. The quoted errors in A w correspond to the er- 
ror on the best fit to the power law w(9) = A w 8~ ' 8 . We 
make no attempt here to correct the correlation function 
for contamination, in line with similar studies with similar 
contamination estimates. However we note that, in the case 
of maximum contamination our estimates of A w could be 
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Figure 11. The angular correlation function for 74 i' < 24.0 
LBGs (bottom panel). The fit is restricted to a fixed slope of 
P = 0.8. The top panel shows l + w(9). Data points are corrected 
for the integral constraint. 



too low by ~ 20% 3 The effect on the deprojected ro, as we 
discuss in the next section, will be smaller. 



5.2 Spatial Correlation Function 

Using our adopted redshift distribution, N(z) (Section l4.2H . 
in conjunction with the best fitting parameters to our angu- 
lar correlation functions, we estimate the corresponding cor- 
relation lengths for galaxies using th e Limber deprojection 
<Peebledll98ttlEMathiou et al]ll99llh The angular correla- 
tion function, w (8), and spatial correlation function, £(r), 
are related via: 



J~F(z)Dl-i(z)N(z) 2 g{z)dz 



where C is a constant, 

: r[( 7 -i)/2] 



G= V^- 



r(7/2) 



(16) 



(17) 



D$(z) is the angular diameter distance, g(z) is a function 
that describes the cosmology, 



g(z) = — [(1 + zf(l + Q, m z + Oa(1 + z)- 2 

c 



(18) 



and F(z) is the redshift dependence of £(r). 

Since the selection function for LBGs is very narrow 
(especially as a function of time), the function F(z) can 



be removed from the integral. This can be verified by us- 
ing both linear and non-linear clustering evolution for the 
function F(z) over our redshift distribu tion. We find this 
gives only negligible chang es in ro (c.f. lOuchi et alJl200lt 
iPorciani fc GiavaliscolEoOSl) . The comoving ro at redshift z 
is therefore given by: 



roO) = r [F(z)] 



(19) 



Using our fiducial redshift distribution and the angu- 
lar correlation function parameters derived in Section 15.11 
we obtain a spatial correlation length (at z ~ 4) of ro = 
11-41^9^00 M P C for the »' < 24 - 5 sample. For the i' < 24.0 
sample we obtain ro = 11. 8^4 o^ioo Mpc (see Table |7| for 
a summary of these results). These correlation lengths are 
considerably greater than other measuremen ts at z ~ 4 (e.g. 
lArnouts et*atlll999ll2(5o2l : IOuchi et al.ll200ll) and the impli- 
cations and context of this are discussed in Section 

In order to test the accuracy of these results we vary the 
N(z) distribution introduced in Section 14.21 over a reason- 
able range of values for z and a z and also consider a simple 
top-hat redshift distribution (see Figure |5J . We find that 
the calculated ro is only weakly dependent on the z used, 
and is only affected at the 10% level, even using a top-hat 
distribution over 3.5 < z < 4.5. However, significant broad- 
ening of the selection function by increasing a z to 0.4 would 
increase ro by up to 40%. It should also be noted that nar- 
rower selection function would lead to smaller values of ro. 
A Gaussian selection function with a z reduced to 0.1 would 
lead to an ro that is ~20% smaller. If we consider the maxi- 
mum final contamination given in Table |3] and assume that 
contaminants are unclustered (see Section fe-H . we find that 
ro would increase by 15%. Note that our quoted results con- 
tain the error derived from the best fitting power law only. 
In line with other LBG studies, the effects of systematic 
uncertainties are not considered further. 



5.3 Linear Bias 

The spatial correlation length ro can be used to calculate 
the linear bias. Several definitions of linear bias appear in 
the literature; here we adopt the definition based on the 
variation in mass over 8 h~ x Mpc spheres (as) and bias is 
defined at our fiducial median redshift (z — 4.0). 

Using our correlation function measurements, and as- 
suming a ACDM cosmology with an n = 1 power spectrum 
of initial mass fluctuations, normalised to as = 0.9, we use 
linear theory (neglecting any non-linear corrections which 
are expected to be small at 2 ~ 4) to determine the linear 
bias, 



(20) 



0"dm 

all measured at a scale of r = 8 h^ 00 Mpc. 

In linear theory, the dark-matter variance, a dm, at red- 
shift z = can be calculated for an arbitrary mass scale M, 
which is equivalent to a physical scale R, via: 



3 If the contaminants are unclustered then A w would be reduced 
by a factor (1 — /) 2 , where / if the contamination fraction. In 
the case where contaminants are clustered, and their clustering 
amplitude is smaller than that for LBGs, the reduction in J\-w 
would be less. 



R 



_3_ M 

4-7T p b 



(21) 



where the density, pb = £l m X (2.78 x 10 11 ) /ifoo- The dark- 
matter variance, (Tdm( z = 0), can then be calculated for a 
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given physical scale R, using the relation between mass-scale 
and variance. At redshift z, the dark-matter variance is given 
by: 



(Tdm{z) = CTdm(z = 0) X Dn n (z), 



(22) 



where Du n {z) is the linear growth factor at the redshift of 
interest; at redshift 4, Du n (z = 4) = 0.25569. The <r g is 
calculated (assuming £ is described by a power law) as: 



■h 



m 



where, 



J 2 = 



72 



(23) 



(24) 



(3 - 7 ) (4 - 7 ) (6-7)2T' 

jPeebledll98ch . See ISomerville et al.1 J2004T) for further de- 
tails and discussion. 

For i' < 24.5 we obtain b = 8.1±{]l, and for i' < 24.0, 
b = 8.4l2 g- The clustering and bias results are summarised 
in Table 



12 



Jc B - 



6 - 



4 - 



it 



it 



o 



24 26 28 

limiting magnitude (r) 



6 DISCUSSION 

In hierarchical models of structure formation, structure 
forms by the magnification of initial density fluctuations 
by gravitational instability. A key feature is that virialised 
structures (or dark matter haloes) should have clustering 
properties that differ from that of the overall mass distribu- 
tion, withmoremassive haloes being more strongly clustered 
(e.g. lKaiserlll984l) . If galaxies and clusters form when bary- 
onic material falls into the potential wells of the dark matter 
haloes then a correlation between galaxy (and hosting halo) 
mass and clustering strength (and therefore bias) would be 
expected. 

An important measurement in studies of LBGs has 
been that of their clustering properties. The strong spa- 
tial clustering and surface densities exhibited by LBGs 
at z ~ 3 appears to be consistent with biased galaxy 
formation, implying an association between LBGs and 
fairly massive dark matter haloes, even if there are sev- 
eral galaxies per dark matter halo fe.g. iGiavalisco et alJ 

[ 19981: IGiavalisco fc Dickinson! EoOll: ISomerville et al.ll200lF 
Wechsler et al.l200lHBullock et all2002l) . The measurement 
of the clustering properties of z~4 LBGs from the ODT sur- 
vey provides values of ro and b that are significantly larger 
than previous measurements of these param eters at this red- 
shift (lOuchi et all200 J: lArnouts et al|l998t) , and of U-band 
dropouts at z ~ 3 |Giavan& c^^t^I .1 119981: lArnouts et al 



W9£ ; IGiavalisco fc DkkilisorTT27)Mi r Tporcrar^ 

2002). However, the data presented here are from a sam- 



ple that is much brighter than other measurements of the 
correlation function at z ~ 4. We are therefore able, by com- 
bining these shallower wide-field data with deeper pencil- 
beam surveys such as the Hubble and Subaru deep fields, 
to compare with th e luminosity depende nt bias detected in 
the loc al Universe ([ Norberg et al.| [2 002^1 and at high red- 
shifts ij Giwalisco fc DickinsonTT200ll : iFoucaud et al.l l2003t 
lOuchi et^nTl2004T) . and test models of biased galaxy forma- 
tion. Using our definition of bias (Equation 1201 we calculate 
bias paramete rs for the z ~ 4 S ubaru deep field cluster- 
ing results of lOuchi et al.l i200lf) . We obtain b = 2.6 for 
i' < 25.5 and b = 2.2 for i' < 26.0. We also consider the 



Figure 12. The relationship between the limiting i'— band mag- 
nitude of a survey and the measured bias (using the trg definition 
as discussed in t he text). ODT data are represented by large open 
stars. Data from lOuchi et all l200ll) are shown as open penta gons. 
Filled squares correspond to data fixmrj0uc3r^t_aEL( 2004). The 
rilled circle represents results frc«rj^rnojit^^aL^ ^99flh . and the 
open circle shows results from lArnouts et al ■ U20021) . 



more recent results of lOuchi et al.l (l2004f) who present clus- 
tering results for several differently defined LBG samples. 
The three samples using data up to a given magnitude limit 
(with b = 5.3 for i' < 24.8, b = 3.5 for i' < 25.3, and 
b — 2.9 for i' < 26.0) are used here. When these data are 
considered alongside the data presented in this paper, there 
appears to be a clear trend between the depth of a sample 
and LBG bias. At 2 ~ 4, our faintest magnitude limit of 
i' ~ 24.5 corresponds to a rest-frame absolute magnitude of 
M 17 oo w -20.5 at A r CSt = 1700 A** Emp loying the z ~ 4 lu- 
minosity function of Stei del et alJ lll999l) , the faintest galax- 
ies in our sample are then L1700 ~ 1.5Z/« (the brig htest 
being L1700 sa 6.1Z/« ) , whe reas the lOuchi et alJ i200lf) and 
lArnouts et alJ J1999L I2002T) samples are, on the whole, sub- 
L„. The luminosity /bias results are summarised in Table |7| 
and are plotted in Figure WI\ 

One interpretation of luminosity dependent bias 
could be a tight correlation between dark matter halo 
mass and star formation rate (i.e. UV l uminosity; 
IGiavalisco fc Dickinson! 1200 ll : ISteidel et alJll998T> . The ob- 
served i'-band magnitudes measured here correspond to a 
rest-frame of A ~ 1500 A. At this wavelength we are mea- 
suring the UV luminosity which is a good tracer of the star 
formation rate in the galaxy. We can therefore infer a link 
between halo mass and star formation rate. However we em- 
phasise that the rest-frame UV luminosity is more indicative 
of the instantaneous star formation rate, rather than total 
underlying light (and therefore stellar mass). 



4 To calculate the ap propriate fc-correction s, we have used the 



rest-frame UV SED of lShaplev et al] <20Q. 
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Reference Sample A w r (/i 10 1 Mpc) b n (h\ w Mpc" 3 ) 

ODT (this paper) i' < 24.0 16.40 H-StJJ S- 4 !!" (2.23 ± 0.25) X 10" 5 

ODT (this paper) i' < 24.5 15.39 H-Cti'g ^ti^ (8.86 ± 0.49) x 10~ 5 

Ouchi et al. (2001) i' < 25.5 0.97 3.2±;J;° 2.6+° g (1.71 ± 0.07) x 10~ 3 

Ouchi et al. (2001) i> < 26.0 0.71 2.7l°'g 2.2t°j (3.72 ± 0.11) x 10" 3 

Ouchi et al. (2004) i' < 24.8 6.1 7.9±2 7 5.3±^;f (2.2 ± 0.6) X 10~ 4 

1.1 o cr+0.6 / Q 9 I -, 1n \ v 1n -4 



Ouchi et al. (2004) i' < 25.3 2.6 5 ^-io 3 - 5 -07 (9.2 ± 1.10) x 10 

(-0-2 r, Q + O.'l 

-0.2 z ' a -0.1 



Ouchi et al. (2004) i' < 26.0 1.7 4.1±°;2 2.9+°;J (4.9 ± 0.30) x 10~ 3 



Table 7. Summary of correlation function parameters, bias and number densities. The bias is calculated as b = (Jg/rr^ using linear 
theory and a power spectrum via n = 1 and erg = 0.9. 



6.1 The Masses and Environments of Bright z ~ 4 
LBGs 

In Figure 1131 we plot bias versus comoving space density 
for galaxies at z ~ 4. The observed space densities are 
calculated in Section 14.31 For the ODT bright sample of 
1 < 24.0 and the faint sample of i! < 24.5, we obtain 
bias values of 8.4 and 8.1, respectively. The corresponding 
space densities, calculated using the same selection function 
used for the correlation function inversion, are 2.23 x 10 
and 8.86 x 1 0~ 5 fej nn Mp c~ 3 . We also show points for the 
lOuchi et all <2004[) and lOuchi et all (l200ll) results, using 
their estimated effective volume and the reported numbers 
of LBGs for each limiting magnitude (with estimated uncer- 
tainties). 

The line in Figure 1131 depicts the correspondence 
between linear bias and comoving space density for dark 
matter halos at z = 4 (based on Sheth & Tormen 1999; 
see also Somerville et al. 2004). Smaller space densities 
correspond to more rare overdensities, and therefore to 
higher minimum dark matter halo masses (as shown). 
This is the relation that ACDM predicts that galaxies 
would follow, if there were only one galaxy in each dark 
matter halo, which is a useful simplification for check- 
ing trends, but not how galaxies tend to exist. In cases 
where there may be varying numbers of galaxies per halo, 
particularly likely if these are starbursting galaxies, the 
'halo occupation' formalism is a useful parametrisation. 
Here, above some threshold minimum mass, we assume 
there is a function N(M > M m i n \a, M\) that describes 
both the internal mass- function slope (a), and the mass 
of a dark matter halo that typically hosts one galaxy 
(Mi; therefore, there may statistically be M > M m in 
halos that have no resident galaxies). This prescription 
is found to describe galaxi es at both z ~ very well 
jMarinoni fc Hudson! 120021: Ivan den Bosch. Yang, fc Mol 
20031: I tlagliocchetti & Porcianil 120031) an d z ~ 3 (e.g. 
Wechsler et all l200ll : iBullock et al.l l2002t iKravtsov etall 
20041) . 

A detailed exploration of the occupation statistics of 
LBGs implied by our measurements is beyond the scope 
of this paper. It is useful, however, to turn to the one- 
galaxy-per-halo curve, and note two items. First, the data 
span a large dynamic range of space densities, and yet fol- 
low the same general trend as the haloes. This suggests 




ltr 6 10- 5 Q.0001 0.001 0.01 

n (h 3 Mpc- 3 ) 



Figure 13. The relationship between galaxy number density and 
the linear bias parameter, for a simple galaxy halo occupation 
function with a = (i.e one galaxy per halo) is shown by the 
solid black line. The numbers marked on the line correspond to 
the dark matter halo masses hosting the observed galaxies in this 
model. Masses are written as logio(M) in units of h~ 1 M©. Open 
stars represent ODT data pr esented in this pap er. Open pen- 
tagons represent the results of[Ouchiet ajj l 200ll) . Filled squares 
correspond to data from IO*ucn^et^lTT2u04T) ~See Table for a 
summary of these values. 

that the generic b/n correlation predicted by ACDM is re- 
flected in the data. Second, the mass-scale of the ODT 
LBG sample, M m i„ ~ few x 10 12 /t- _1 Mq, begs the ques- 
tion whether these naturally connect with later galaxy pop- 
ul ations known to be in s i milarl y massive environments. 
In iMoustakas fc Somervillel (|2002j), the possible clustering 
evolution between massive galaxy populations at z ~ 0, 
1.2, and 3 were explored in the context of simple ACDM- 
motivated models. The z ~ 3 galaxy population charac- 
teristics were drawn from the literature of largely-sub-L» 
samples, and were shown to not have a natural connec- 
tion with the lower-redshift (z ~ 1.2) populations, which 
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are plausibly connected with the most massive (elliptical- 
galaxy-type) environments today. The z ~ 1.2 (ERO) pop- 
ulation's linear bias and comoving space density, when ex- 
trapol ated to z ~ 4 (under the 'con stant minimum mass' 
model; iMoustakas fc Somervillell2002l) are expected to have 
b ~ 8 and n w 10 _5 /i 10 o Mpc -3 - values remarkably close to 
those measured in the super-L» ODT LBG sample. 

6.2 Significance of Results 

Thus far the quoted errors on our measured ro and bias 
have only included 'random' errors based on the best fit to 
the angular correlation function. In Section [4.21 we derived 
our redshift distribution based on an analysis of theoretical 
colours of high redshift galaxies. We also considered a range 
of possible median redshifts and found that this led to a 
±10% uncertainty on the derived value for ro. Combining 
this error in quadrature with the measured statistical errors 
leads to r = 11.4j j'° and r = 11.8 jjj '?. This is a difference 
from the results of lOuchi et alJ (1200111 at the 3.7-<r level for 
the faint sample and 2.1-cr for the bright sample. 

In Section 15.21 the possibility of broader and narrower 
redshift distributions than the one we assume was also dis- 
cussed, leading to possible increases in ro of up to 40%, 
and decreases in ro of up to 20%. Using a +40%/— 20% un- 
certainty for the redshift distribution gives values of ro = 
II.4I3 Q and ro = 11.8j;|j'g for the faint and bright samples 
respectiv ely. This would be enough to reduce the difference 
from the lOuchi et alJ (1200 ll) results to the 2.7-a and 1.9-cr 
level. 

We also noted in Section lo.ll that the effect on contami- 
nation on our LBG sample is likely to dilute the correlation 
function, and any correction that might be applied would 
lead to an increase in ro, further strengthening any rela- 
tionship between luminosity and bias. 

6.2.1 Cosmic Variance 

Since the LBG population is known to be strongly clustered, 
it is clear that the number density of objects will vary from 
field to field. It is therefore important that a large enough 
volume of sky is surveyed so that this 'cosmic variance' does 
not significantly bias the measured number density of ob- 
jects relative to the overall cosmic average. In order to esti- 
mate the size of such an effect o n this data set we use the 
results of lSomerville et all (120041) who compute the variance 
of dark matter from linear theory as a function of redshift. 

Based on the estimated effective volume at z ~ 4 cov- 
ered in this survey (3 x 10 4 Mpc 3 , assuming h = 0.7), the 
variance in the mean d ark matter density is es timated to be 
15% (see Figure 3b in ISomerville et aljEooi) . How cosmic 
variance relates to an uncertainty in number density of a 
population depends on how biased that population is rela- 
tive to the overall dark matter distribution. Assuming the 
bias measured in Section r5.3l is correct, then the uncertainty 
in the number density of LBGs due to cosmic variance can 
be estimated as <r cv = 8.1 x 0.15 = 1.2. Therefore, in Figure 
13 there is an additional uncertainty in the estimated space 
density due to the sample size (volume) and cosmic variance 
associated with this. 

Could fluctuations in number density caused by cos- 
mic variance produce the observed difference in clustering 



strength between the ODT and fainter samples? This is dif- 
ficult to test since estimates of the size of cosmic variance 
effects from linear theory require prior knowledge of the re- 
lationship between the galaxy population being studied and 
the underlying dark matter distribution (i.e. bias). However, 
if we start with the null hypothesis that the intrinsic bias 
of the O DT LBG sample is actually the same as that mea- 
sured bv lOuchi et alJ (1200 if) , then the uncertainty in num- 
ber densit y due to cosmic varian ce, a cv = 2.6 x 0.15 = 0.39 
( following [Somerville et al.l2004|) . If we have underestimated 
the intrinsic number density due to cosmic variance, and 
assume that £ tx l/(n 2 ) , then the 'true' number den- 
sity could indeed be larger, and the correlation length (and 
hence bias) could be smaller 5 . Taking a cv — 0.39, and as- 
suming a 2-cr fluctuation, implies an upper limit on the 
number density of 1.58 xlO~ 4 fe 3 n n Mpc -3 . If this cor- 
responds to the lOuchi et al.l i200lfl correlation length of 
ro = 3.2 h^QQ Mpc, then our measured value for the number 
density (8.86x 10" 5 /i 100 Mpc" 3 ) would scale r to 6.1 h^ 
Mpc which is inconsistent with our measured values (in fact 
a 5 .5-g fluctuat i on in number density is required to scale 
the lOuchi et alJ i200lfi value to 11.4 h^ Mpc). Variations 
in number density due to cosmic variance are too small to ex- 
plain the difference in clustering strength between the ODT 
and fainter samples. However, it is clear that wide field sur- 
veys at least as large as the ODT are required to measure 
the clustering properties of the bri ghtest object s . We also 
note that the independent results of lOuchi et alJ (1200411 are 
consistent with the trend of increasing clustering strength 
with brighter magnitudes at z ~ 4. 

6.3 Small Scale Clustering and Close Pairs 

In addition to a lu minosity dependent bias, 
IPorciani fc Giavaliscol |2002) provide evidence from a 
fairly bright sample of z ~ 3 LBGs that there is also a 
significant scale-dependent bias. In their data set there 
appears to be a lack of power in the angular correlation 
function ( and therefore a lower bias) on scales less than 
30 arcsec. iBullock et al.l (I2002T) also demonstrate (where 
redshifts are available) how close pair statistics can be used 
within the halo occupation function formalism to place 
constraints on differen t mod els for the occupation function. 
IPorciani fc Giavaliscol (I2002T) interpret the 'break' in their 
correlation function as evidence that the dark matter haloes 
have a fairly large size and mass, which is more consistent 
with one-galaxy-per-halo occupation functions. However it 
should be noted that our analysis (and that of most other 
LBG clustering studies) has assumed a simple power law 
for the c orrelation function wit h 7 = 1.8. In addition, the 
results of lKravtsov et all (I2004T) (which come from iV-body 
simulations) indicate that the correlation function may 
actually steepen on the smallest scales, (which in turn 

5 This approximation (£ oc l/(n 2 )) assumes that Gaussian statis- 
tics apply, which is not the case here. A full calculation of the ef- 
fects of cosmic variance requir es knowledge of higher order cluster- 
ing statistics. Simulations by Fouca ud et alJ d2003Tl suggest that 
for a similar bright sample but at z ~ 3, the size of the cosmic 
error is likely to be smaller than the measured Poisson errors. 
However, this rough calculation provides a useful measure of the 
potential size of the 'cosmic error'. 



© 0000 RAS, MNRAS 000, 000-000 



14 Allen et al. 



w ould lead to ov e r-esti mates of ro for LBGs). The models 
of lHamana et all J2004T) indicate that the observed angular 
correlation function is well fitted by a two-component power 
law with a steep componen t dominating on small scales, 
although lOuchi et al] (120041) fit the same data with single 
component model with a relatively steep (/3 = 0.9) slope. 

In the ODT data presented here we find no close pairs 
on scales less than 10 arcsec in the i' < 24.5 sample and none 
less than 40 arcsec in the brighter i' < 24.0 sample. Unfor- 
tunately, since only one or less pairs would be required to 
fit the best-fitting power law on these scales, it is not pos- 
sible to determine whethe r or not this constitutes evid ence 
to support the results of iPorciani fc Giavaliscol i2002f) . or 
indeed to determine whether a steeper slope is more appro- 
priate on small scales. However an analysis of the full ODT 
survey should yield a good measurement of the small-scale 
clustering of bright LBGs, and help assess the significance of 
any lack of power on small-scales in the angular correlation 
function. 



7 CONCLUSIONS 

The main results presented here can be summarised as fol- 
lows: 

• The clustering of LBGs at z ~ 4 is well approximated 
by a power law w{9) = A W Q~^ . Fixing the slope to that of 
local galaxies (fi — 0.8), provides a good fit to the data with 
A w — 15.39 arcsec ' 8 the best amplitude for i' < 24.5 and 
A m = 16.40 arcsec ' 8 for i' < 24.0. However we note that 
there is evidence to suggest that j3 — 0.8 may not be a valid 
assumption to make at high redshift, and a shallower slope 
may be more appropriate. 

• Using a reasonable fiducial redshift distribution to char- 
acterise the selection function for LBGs, the angular cor- 
relation function can be deprojected to obtain the spatial 
correlation length, ro. For our fainter sample {i! < 24.5) 
we obtain ro = 11.41 \'gh^ 00 Mpc. Using a slightly brighter 
sample (i 1 < 24.0) we obtain a similar correlation length 
of ro = 11.8j;4'J/i^ o Mpc. Comparing these results to the 
clustering properties of dark matter, we obtain linear bias 
values of 8.ll^2 and 8.4+2'g respectively. 

• When compared with fainter surveys the bias and cor- 
relation lengths seen in the ODT are clearly significantly 
larger. We interpret this as evidence for a luminosity de- 
pendent bias for LBGs, as predicted by some semi-analytic 
models. The ODT bias values are shown to be consistent 
with a simple model using a galaxy occupation function de- 
scribing one observable galaxy per dark matter halo. Bright 
(super— L*) LBGs seem to be more biased tracers of mass 
than fainter (sub— L*) ones, suggesting a relationship be- 
tween halo mass and instantaneous star formation rate. 

• The population of sub— L» LBGs are unlikely to be the 
progenitors of massive galaxies at z ~ 1 and z ~ 0. How- 
ever, the extremely bright, super— L* population presented 
in this paper have biases, number densities and halo masses 
(Mmin ~ few x 10 12 /i _1 M@) that are consistent, in a simple 
model, with them being potential progenitors of the most 
massive galaxies. 

The data presented here are only a small sample of 
the full ODT survey. With an extended sample containing 



> 1000 LBGs over a much wider area it should be possible to 
place stronger constraints on the clustering properties of this 
bright sample of high redshift galaxies, and address issues 
such as their scale dependent bias. In additio n, other surveys 
curre ntly under way such as the NDW FS lljannuzi fc Devi 
I1999D . GOO DS JPickinson et alll2003h . the VIRMOS-VLT 
deep survey i Le^evre^^lJJW9a) , the CFHT Legacy sur- 
vey 6 , SXDS iKodama et al]l2004ll . and COSMOS 7 , should 
provide good measurements of LBG clustering probing 
(when combined) a large dynamic range in luminosity and 
redshift. This should allow tighter constraints on parameters 
such as the galaxy occupation function, and the scale and 
luminosity dependent bias for LBGs; thus providing a bet- 
ter understanding of the relationship between LBGs, other 
high-z populations, and galaxies today. 



ACKNOWLEDGEMENTS 

The Isaac Newton Telescope is operated on the island of 
La Palma by the Isaac Newton Group in the Spanish Ob- 
servatorio del Roque de los Muchachos of the Instituto de 
Astrofisica de Canarias. PDA, CEH, and CAB acknowl- 
edge the support of PPARC Studentships. ECM thanks the 
C.K. Marr Educational Trust. This work was supported by 
the PPARC Rolling Grant PPA/G/O/2001/00017 at the 
University of Oxford. LAM acknowledges support from the 
SIRTF Legacy Science Program, provided through an award 
issued by the Jet Propulsion Laboratory, California Institute 
of Technology under NASA contract 1407. Thanks to both 
anonymous referees whose comments helped to improve this 
paper. 



REFERENCES 

Adelberger K. L., Steidel C. C, 2000, ApJ, 544, 218 
Adelberger K. L., Steidel C. C, Giavalisco M., Dickinson 

M., Pettini M., Kellogg M., 1998, ApJ, 505, 18 
Adelberger K. L., Steidel C. C, Shapley A. E., Pettini M., 

2003, ApJ, 584, 45 
Arnouts S., Cristiani S., Moscardini L., Matarrese S., 

Lucchin F., Fontana A., Giallongo E., 1999, MNRAS, 310, 

540 

Arnouts S., et al., 2002, MNRAS, 329, 355 
Bahcall J. N., 1986, ARAA, 24, 577 

Baugh C. M., Cole S., Frenk C. S., Lacey C. G., 1998, ApJ, 
498, 504 

Baugh C. M., Gardner J. P., Frenk C. S., Sharpies R. M., 

1996, MNRAS, 283, L15 
Bell E. F., et al., 2004, ApJ, 608, 752 
Bertin E., Arnouts S., 1996, A&AS, 117, 393 
Brainerd T. G., Smail I., 1998, ApJL, 494, L137 
Bullock J. S., Wechsler R. H., Somerville R. S., 2002, MN- 
RAS, 329, 246 

Coleman G. D., Wu C.-C, Weedman D. W., 1980, ApJS, 
43, 393 

de Lapparent V., Galaz G., Bardelli S., Arnouts S., 2003, 
A&A, 404, 831 

6 http: //www.cfht.hawaii.edu/Science/CFHTLS 

7 http: //www. astro. caltech.edu/~cosmos 



© 0000 RAS, MNRAS 000, 000-000 



Clustering of Bright Lyman Break Galaxies in the ODT Survey 15 



Dickinson M., Giavalisco M., The Goods Team, 2003, in 
The Mass of Galaxies at Low and High Redshift. Pro- 
ceedings of the ESO Workshop held in Venice, Italy, 24-26 
October 2001, p. 324., p. 324 

Efstathiou G., Bernstein G., Tyson J. A., Katz N., 
Guhathakurta P., 1991, ApJL, 380, L47 

Fan X., et al., 2001, AJ, 121, 54 

Fioc M., Rocca-Volmerange B., 1997, A&A, 326, 950 
Foucaud S., McCracken H. J., Le Fevre O., Arnouts S., 

Brodwin M., Lilly S. J., Crampton D., Mellier Y., 2003, 

A&A, 409, 835 
Gardner J. P., 1998, PASP, 110, 291 
Giavalisco M., Dickinson M., 2001, ApJ, 550, 177 
Giavalisco M., Steidel C. C, Adelberger K. L., Dickinson 

M. E., Pettini M., Kellogg M., 1998, ApJ, 503, 543 
Glazebrook K., Peacock J. A., Collins C. A., Miller L., 

1994, MNRAS, 266, 65 
Groth E. J., Peebles P. J. E., 1977, ApJ, 217, 385 
Hamana T., Ouchi M., Shimasaku K., Kayo I., Suto Y., 

2004, MNRAS, 347, 813 
Hill G. J., Rawlings S., 2003, New Astronomy Review, 47, 

373 

Jannuzi B. T., Dey A., 1999, in ASP Conf. Ser. 193: The 

Hy-Redshift Universe: Galaxy Formation and Evolution 

at High Redshift, p. 258 
Kaiser N., 1984, ApJL, 284, L9 
Kodama T., et al., 2004, MNRAS, 350, 1005 
Kravtsov A. V., Berlind A. A., Wechsler R. H., Klypin 

A. A., Gottlober S., Allgood B., Primack J. R., 2004, ApJ, 

609, 35 

Landolt A. U., 1992, AJ, 104, 340 
Landy S. D., Szalay A. S., 1993, ApJ, 412, 64 
Le Fevre O., et al., 1998, in Wide Field Surveys in Cos- 
mology, 14th IAP meeting held May 26-30, 1998, Paris. 
Publisher: Editions Frontieres. ISBN: 2-8 6332-241-9, p. 
327., p. 327 

Ling E. N., Barrow J. D., Frenk C. S., 1986, MNRAS, 223, 
21P 

MacDonald E. C, et al., 2004, MNRAS, 352, 1255 
Madau P., 1995, ApJ, 441, 18 

Magliocchetti M., Porciani C, 2003, MNRAS, 346, 186 
Marinoni C, Hudson M. J., 2002, ApJ, 569, 101 
McCracken H. J., Le Fevre O., Brodwin M., Foucaud S., 
Lilly S. J., Crampton D., Mellier Y., 2001, A&A, 376, 756 
Moustakas L. A., Somerville R. S., 2002, ApJ, 577, 1 
Norberg P., et al., 2002, MNRAS, 332, 827 
Olding E. J., 2002, D.Phil. Thesis, University of Oxford. 
Ouchi M., et al., 2001, ApJL, 558, L83 
Ouchi M., et al., 2004, ApJ, 611, 685 

Peebles P. J. E., 1980, The large-scale structure of the uni- 
verse. Princeton University Press, 1980. 435 p. 

Pettini M., Shapley A. E., Steidel C. C, Cuby J., Dickinson 
M., Moorwood A. F. M., Adelberger K. L., Giavalisco M., 
2001, ApJ, 554, 981 

Pickles A. J., 1998, PASP, 110, 863 

Porciani C, Giavalisco M., 2002, ApJ, 565, 24 

Postman M., Lauer T. R., Szapudi I., Oegerle W., 1998, 
ApJ, 506, 33 

Robin A., Creze M., 1986, A&A, 157, 71 

Robin A. C, Haywood M., Creze M., Ojha D. K., Bienayme 
O., 1996, A&A, 305, 125 

Roche N., Eales S. A., 1999, MNRAS, 307, 703 



Shapley A. E., Steidel C. C, Adelberger K. L., Dickinson 
M., Giavalisco M., Pettini M., 2001, ApJ, 562, 95 

Shapley A. E., Steidel C. C, Pettini M., Adelberger K. L., 
2003, ApJ, 588, 65 

Somerville R. S., Lee K., Ferguson H. C, Gardner J. P., 
Moustakas L. A., Giavalisco M., 2004, ApJL, 600, L171 

Somerville R. S., Primack J. R., Fabcr S. M., 2001, MN- 
RAS, 320, 504 

Steidel C. C, Adelberger K. L., Dickinson M., Giavalisco 

M., Pettini M., Kellogg M., 1998, ApJ, 492, 428 
Steidel C. C, Adelberger K. L., Giavalisco M., Dickinson 

M., Pettini M., 1999, ApJ, 519, 1 
Steidel C. C, Giavalisco M., Pettini M., Dickinson M., 

Adelberger K. L., 1996, ApJL, 462, L17+ 
Steidel C. C, Hamilton D., 1992, AJ, 104, 941 
Stevens R., Lacy M., 2001, MNRAS, 325, 897 
van den Bosch F. C, Yang X., Mo H. J., 2003, MNRAS, 

340, 771 

Wechsler R. H., Somerville R. S., Bullock J. S., Kolatt T. S., 
Primack J. R., Blumenthal G. R., Dekel A., 2001, ApJ, 
554, 85 



© 0000 RAS, MNRAS 000, 000-000 



